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180 



214 PQEEPVYISPQPRSLKENLSPMTPGNLLDVIDNLRFTPLTETRGKGQATIFPDNLAAWPT 273 

181 PTLKGNVKSCANDMRPRRITPDDLEDQPATNKTFDVKHSETINISLDTLDCSRIDGQPHT 240 

| | 1 | | | | | M I I I 1 I I I I I I I M I I I I I I M I 1 I I I I I M I I I I I I I I I I I I M 1 I I M I 
274 PTLKGNVKSCANDMRPRRITPDDLEDQPATNKTFDVKHSETINISLDTLDCSRIDGQPHT 333 

241 PLNKTTTIVHATHTRALACIHEEEGPSPPRTPTKSAIHDLKRDIKLVGSPLRKYSESMKD 300 

| | | | | | M I I I I I I I I II II I I I III I I I I I M I I I I I I M 

334 PLNKTTTIVHATHTRAIACIHEEEGPSPPRTPTKSAIHDLKRDIKLVGSPLRKYSESMKD 393 

301 LSLLSPQTKYAIQGSMPNLNEMKIRSIEQNRYYQEQQIQIKAKDLNSSSSSEASLAGQQE 360 

| | I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

394 LSLLS PQTKYAIQGSMPNLNEMKI RS I EQNRYYQEQQIQI KAKDLNSS S S SEASLAGQQE 453 

361 FLFNHSEI LAQS S RFNLHEVGRKSVKGS PVKNPHKRRSHELS FS DAPSNESLYRNETVAI 420 

| | M I I II I I II I I I I I I I I I I I M II II II M I I I I I M I I I I I I I I I I I I I I I I I I I I 
454 FLFNHSEILAQSSRFNLHEVGRKSVKGSPVKNPHKRRSHELSFSDAPSNESLYRNETVAI 513 

421 SPPKKQRVF1DTTLPRSAAPANASARSSSAHAWPHAQSKKFKIAQTMSLMKKPATPRKVRD 480 

| | | | I M M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I II I I I I M I M I I I I I I I 

514 SPPKKQRVEDTTLPRSAAPANASARSSSAHAWPHAQSKKFKLAQTMSLMKKPATPRKVRD 573 

481 TSIQPSVKLYDSELYMQTCINPDPFAATTTIDPFLASTMYLDEQAVDRHQADFKKWLNAL 540 

| | M | I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I 
574 TSIQPSVKLYDSELYMQTCINPDPFAATTTIDPFLASTMYLDEQAVDRHQADFKKWLNAL 633 

541 VSIPADLDADLNNKIDVGKLFNEVRNKELWAPTKEEQSMNYLTKYRLETLRKAAVELFF 600 

| | | | | | | I I I I I I I I I I II II I I I II I I I II I I I I II I II I I I I I I M I I I I I I I I I M I 
634 VSIPADLDADLNNKIDVGKLFNEVRNKELWAPTKEEQSMNYLTKYRLETLRKAAVELFF 693 

601 SEQMRLPCSKVAVYVNKQALRIRSDRNLHLDVVMQRTILELLLCFNPLWLRLGLEWFGE 660 

| II M I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I II I I I I I II I I I 
694 SEQMRLPCSKVAVYVNKQALRIRSDRNLHLDVVMQRTILELLLCFNPLWLRLGLEWFGE 753 

661 KIQMQSNRDIVGLSTFILNRLFRNKCEEQRYSKAYTLTEEYAETIKKHSLQKILFLLPFL 720 

| | | | | | | | I I I I I ! I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 1 I I I I I I I I I I I I II 
754 KIQMQSNRDIVGLSTFILNRLFRNKCEEQRYSKAYTLTEEYAETIKKHSLQKILFLLLFL 813 

721 DQAKQKRIVKHNPCLFVKKS PHKETKDI LLRFS SELLANI GDITRELRRLGYVLQHRQTF 780 

| | M I I I I I I I I I I M I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
814 DQAKQKRIVKHNPCLFVKKS PHKETKDI LLRFS SELLANI GDITRELRRLGYVLQHRQTF 873 

781 LDEFDYAFNNLAVDLRDGVRLTRWEVILLRDDLTRQLRVPAI SRLQRI FNVKLALGALG 840 

| | I I I II I I I I I I I I M I I I I M I : I I I I I I I I I I I I I I I I I I I I I I I I I I 

874 LDEFDYAFNNLAVDLRDGVRLTRVMEVILLRDDLTRQLRVPAI SRLQRI FNVKLALGALG 933 

841 EANFQLGGDIAAQDIVDGHREKTLSLLWQLIYKFRSPKFHAAATVLQKWWRRHWLHWIQ 900 

| | | I I I II I I M I I I I I II I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I II I I 
934 EANFQLGGDIAAQDIVDGHREKTLSLLWQLIYKFRSPKFHAAATVLQKWWRRHWLHWIQ 993 

901 RRIRHKELMRRHRAATVIQAVFRGHQMRKYVKLFKTERTQAAIILQKFTRRYLAQKQLYQ 960 

M M I I I I I I I I I I I I II I I I I I I I I M I I II M II I I I I I I I I I I I I I M I I I I I I M I 

994 RRIRHKELMRRHRAATVIQAVFRGHQMRKYVKLFKTERTQAAIILQKFTRRYLAQKQLYQ 1053 

961 SYHSIITIQRWWRAQQLGRQHRQRFVELREAAIFLQRIWRRRLFAKKLLAAAETARLQRS 1020 

| | | | || | || | | | | | I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1054 SYHSIITIQRWWRAQQLGRQHRQRFVELREAAIFLQRIWRRRLFAKKLLAAAETARLQRS 1113 



1021 QKQQAAASYIQMQWRTYQLGRIQRHEFLRQRDLIMFVQRRMRSKWSMLEQRKEFQQLKRA 1080 

I I I I I ! I I I I : I I I I I I I I = I I 1 1 I I I 1 I I 1 1 I 1 I I I I I 1 t I 1 I I 1 I I I I 1 1 1 I 

1114 QKQQAAASYIQMQWRSYQLGRIQRQQFLRQRDLIMFVQRRMRSKWSMLEQRKEFQQLKRA 1173 

1081 AINI QQRWRAKLSMRKCNADYLALRS S VLKVQAYRKAT I QMRI DRNHYYS LRKNVI CLQQ 1140 

| | I I I I I I I I I I I I I I I I I I ! I I I I I I I 1 I II I I I I I I I I I M I I 1 I I I I I I I I I I I M I 
1174 AINI QQRWRAKLSMRKCNAD YLALRS SVLKVQAYRKAT I QMRI DRNHYYS LRKNVI CLQQ 1233 

1141 RLRAIMKMREQRENYLRLRNASILVQKRYRMRQQMIQDRNAYLRTRKCIINVQRRWRATL 1200 

| | | I I I I I I I I I I I I I I II I I I I I I I I I I M I I M I I I I I I I II I I I I I I I II I 

1234 RLRAIMKMREQRENYLRLRNASILVQKRYRMRQQMIQDRNAYLRTRKCIINVQRRWRATL 1293 

12 01 QMRRERKNYLHLQTTTKRIQIKFRAKREMKKQRAEFLQLKKVTLWQKRRRALLQMRKER 1260 

| | I I M I I I I II I I I I I I I I I II I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 

1294 QMRRERKNYLHLQTTTKRIQIKFRAKREMKKQRAEFLQLKKVTLWQKRRRALLQMRKER 1353 

12 61 QEYLHLREVTIKLQRRFHAQKSMRFMRAKYRGTQAAVSCLQMHWRNHLLRKRERNSFLQL 1320 

| | | | | I I I I I II I I II I I I I I II II I I I I II I I I I I I I I M I I I II I I I I I I I I I M I I I 
1354 QEYLHLREVTIKLQRRFHAQKSMRFMRAKYRGTQAAVSCLQMHWRNHLLRKRERNSFLQL 1413 

1321 RQAAITLQRRYRARLNMIKQLKSYAQLKQAAITIQTRYRAKKAMQKQWLYQKQREAIIK 1380 

| | M I I I I M I I I II I I I I I M I M I M I I II II I I I I I I I I I I I I I I I I I I I M II I II 

1414 RQAAITLQRRYRARLNMIKQLKSYAQLKQAAITIQTRYRAKKAMQKQWLYQKQREAIIK 1473 

1381 VQRRYRGNLEMRKQIEVYQKQRQAVIRLQKWWRSIRDMRLCKAGYRRIRLSSLSIQRKWR 1440 

|| || || | | | | | | | I I I I I I I II II I I I I I I M I I I I I I I I I I I I I I I M I I M I I I I I I I 
1474 VQRRYRGNLEMRKQI EVYQKQRQAVI RLQKWWRS I RDMRLCKAGYRRI RLS SLS IQRKWR 1533 

1441 ATVQARRQREIFLSTIRKVRLMQAFIRATLLMRQQRREFEMKRRAAWIQRRFRARCAML 1500 

| | | I I I I I I I I I I I II I I I I I I I II I I I I I I II II I I M I I I I M 

1534 ATVQARRQREIFLSTIRKVRLMQAFIRATLLMRQQRREFEMKRRAAWIQRRFRARCAML 1593 

1501 KARQDYQLIQSSVILVQRKFRANRSMKQARQEFVQLRTIAVHLQQKFRGKRLMIEQRNCF 1560 

| | M I I I I I I I I I I I I I I I I I I I I M I I M I I I M I I N I I I I I I I I I I II I I I I I I I M 
1594 KARQDYQLIQSSVILVQRKFRANRSMKQARQEFVQLRTIAVHLQQKFRGKRLMIEQRNCF 1653 

1561 QLLRCSMPGFQARARGFMARKRFQAIKTPEMMDLIRQKRAAKVIQRYWRGYLIRRRQKHQ 1620 

| | M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

1654 QLLRCSMPGFQARARGFKARKRFQALMTPEMMDLIRQKRAAKVIQRYWRGYLIRRRQKHQ 1713 

1621 GLLDIRKRIAQLRQEAKAVNSVRCKVQEAVRFLRGRFIASDALAVLSQLDRLSRTVPHLL 1680 

MM | MM I Ml I I I MINI II I I II M I MM III M Mil I Ihll M I I HUM 
1714 GLLDIRKRIAQLRQEAKAVNSVRCKVQEAVRFLRGRFIASDALAVLSRLDRLSRTVPHLL 1773 

1681 MWCSEFMSTFCYGIMAQAIRSEVDKQLIERCSRIILNLARYNSTTWTFQEGGLVTIAQM 1740 

M I I I I I I M I I I M I I I I I I I I I I I I I I II I I M I I I I I I I I I I I I I I I I I M I I I I I I 

1774 MWCSEFMSTFCYGIMAQAI RSEVDKQLIERCSRIILNLARYNSTTVNTFQEGGLVTIAQM 1833 

1741 LLRWCDKDSEIFNTLCTLIWVFAHCPKKRKIIHDYMTNPEAIYMVRETKKLVARKEKMKQ 1800 

| | | | | | | | | I M I I I I I I I I I I I I I I I I I I I I I I I M II I I I I I I I I I I I I I I I I I I M I 
1834 LLRWCDKDSEIFNTLCTLIWVFAHCPKKRKIIHDYMTNPEAIYMVRETKKLVARKEKMKQ 1893 

1801 NARKPPPMTSGRYKSQKINFTPCSLPSLEPDFGIIRYSPYTFISSVYAFDTILCKLQIDM 1860 

M II I II I II I I M M II II H M II I I H I I I M I M I II II II M M MM 

1894 NARKPPPMTSGRYKSQKINFTPCSLPSLEPDFGIIRYSPYTFISSVYAFDTILCKLQIDM 1953 
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GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 

Ru n on: February 22, 2005, 23:31:55 ; Search time. 61 Seconds 

(without alignments) 
2935.400 Million cell updates/sec 



Title: US-09-914-698-1 
Perfect score: 9514 
Sequence: 



9514 

1 MELVWSPVLEVACKETLQLI FI S S VYAFDT I LCKLQI DMF 1861 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283416 seqs, 96216763 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283416 



Database 



PIRJ79:* 
pirl : * 
pir2:* 
pir3:* 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 

mxcrotubule-associated protein - fruit fly (Drosophila melanogaster ) 
f'Snecies: Drosophila melanogaster onn/i 

ciDate: 20-Sep-1999 #sequence_revision 20-Sep-1999 #text_change 09-Jul-2004 

Kis^dSrR'D'rLides, M.C., Howard, T.; Gonzalez, C; Glover, D.M. 

AiT^le.^-DrrsUila 8 ^ 1 ^^! spindle encodes a microtubule- -associated 
protein that associates with the polar regions of the natotxc spindle. 
A;Reference number: Z17792; MUID: 97296495; PMID:9151690 

A;Accession: T13845 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 

^^"-"iaiiicS^^RDTrOOWOl, EMBL:U95171; NID: gl930121; PID:gl930122; 
PIDN:AAB51540. 1 



C; Genetics : 
A; Gene: asp 

A; Cross-references : FlyBase: FBgn0000140 
C; Function: 

A; Description: is required for the normal function of the mitotic spindle 

Query Match 100.0%; Score 9514; DB 2; Length 1861; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1861; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

■qv 1 MELVWSPVLEVACKETLQLIDNRNFRKEVMIILKSKSNQPVKNPRKFPTVGKTLQLKSPT 60 

| 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db i MELVWSPVLEVACKETLQLIDNRNFRKEVMIILKSKSNQPVKNPRKFPTVGKTLQLKSPT 60 

QV 61 GAGKTMKS WS AAVQQKKRMS AAAAP P S KQTWRVTAP S RPAAWAH P P PQAP LVEKNVYKT 120 

| | | | | | | M | | I I I I I I I I I I I I I I I I I I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I 
Db 61 GAGKTMKS WS AAVQQKKRMS AAAAP P S KQTWRVTAP S RPAAWAH P P PQAP LVEKNVYKT 120 

Qv 121 PQEEPVYI S PQPRS LKENLSPMTPGNLLDVI DNLRFTPLTETRGKGQATI FPDNLAAWPT 180 

| | | | || | | || | | | | I I I I I I I I I I I I I I II I I I I I I II I I I I I I I M I I I II I I I I I I I I 
Db 121 PQEEPVYI S PQPRS LKENLSPMTPGNLLDVI DNLRFTPLTETRGKGQATI FPDNLAAWPT 180 

Ov 181 PTLKGNVKSCANDMRPRRITPDDLEDQPATNKTFDVKHSETINISLDTLDCSRIDGQPHT 240 

| | | [ | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 PTLKGNVKSCANDMRPRRITPDDLEDQPATNKTFDVKHSETINISLDTLDCSRIDGQPHT 240 

0v 241 PLNKTTTIVHATHTRALACIHEEEGPSPPRTPTKSAIHDLKRDIKLVGSPLRKYSESMKD 300 

| | | | | | | | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 PLNKTTTIVHATHTRALACIHEEEGPSPPRTPTKSAIHDLKRDIKLVGSPLRKYSESMKD 300 

Ov 301 LSLLSPQTKYAIQGSMPNLNEMKIRSIEQNRYYQEQQIQIKAKDLNSSSSSEASLAGQQE 360 

| | | | | | | | | | | I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
LSLLSPQTKYAIQGSMPNLNEMKIRSIEQNRYYQEQQIQIKAKDLNSSSSSEASLAGQQE 



Db 301 



360 
420 



Ov 361 FLFNHSEILAQSSRFNLHEVGRKSVKGSPVKNPHKRRSHELSFSDAPSNESLYRNETVAI 

y | | I I I I I I I I I I I I I I I I I I I I I I I II INI II II II I I INI Mill I II II II 

Db 361 FLFNHSEILAQSSRFNLHEVGRKSVKGSPVKNPHKRRSHELSFSDAPSNESLYRNETVAI 420 

Ov 421 SPPKKQRVEDTTLPRSAAPANASARSSSAHAWPHAQSKKFKLAQTMSLMKKPATPRKVRD 480 

| | | | | | | | | | | | I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 

Db 421 SPPKKQRVEDTTLPRSAAPANASARSSSAHAWPHAQSKKFKLAQTMSLMKKPATPRKVRD 480 

0 481 TSIQPSVKLYDSELYMQTCINPDPFAATTTIDPFLASTMYLDEQAVDRHQADFKKWLNAL 54 0 

| | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 481 TSIQPSVKLYDSELYMQTCINPDPFAATTTIDPFLASTMYLDEQAVDRHQADFKKWLNAL 540 



Ov 541 VSIPADLDADLNNKIDVGKLFNEVRNKELWAPTKEEQSMNYLTKYRLETLRKAAVELFF 

| | | | | | | | | | | | | I | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 541 VSIPADLDADLNNKIDVGKLFNEVRNKELWAPTKEEQSMNYLTKYRLETLRKAAVELFF 



600 
600 
660 



0v 601 SEQMRLPCSKVAVYVNKQALRIRSDRNLHLDWMQRTILELLLCFNPLWLRLGLEWFGE 

I I I I | I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I N I I I I I I I I I I I I I I I I I I 
Db 601 SEQMRLPCSKVAVYVNKQALRIRSDRNLHLDWMQRTILELLLCFNPLWLRLGLEWFGE 660 

Ov 661 KIQMQSNRDIVGLSTFILNRLFRNKCEEQRYSKAYTLTEEYAETIKKHSLQKILFLLPFL 720 

|| | MM I III II I IMIIIIII II III III I M III Ml Ml H I I Ml M H MINI 

Db 661 KIQMQSNRDIVGLSTFILNRLFRNKCEEQRYSKAYTLTEEYAETIKKHSLQKILFLLPFL 720 



721 DQAKQKRIVKHNPCLFVKKSPHKETKDILLRFSSELLANIGDITRELRRLGYVLQHRQTF 7 80 

| | | | | | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
721 DQAKQKRIVKHNPCLFVKKSPHKETKDILLRFSSELLANIGDITRELRRLGYVLQHRQTF 780 



781 L DE FD YAFNN LAVDLRDGVRLT RWEVI LLRDD LT RQ LRVP AI S RLQ RI FNVKLAL GALG 

| | | | | | | | | | | I I I I I I I I I I I I I I I I I I I I I I I i I I M I I I I I I I I I I I I I 

781 LDEFDYAFNNLAVDLRDGVRLTRWEVILLRDDLTRQLRVPAISRLQRIFNVKLALGALG 



840 
840 



841 EANFQLGGDIAAQDIVDGHREKTLSLLWQLIYKFRSPKFHAAATVLQKWWRRHWLHWIQ 900 

| | | | | | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

841 EANFQLGGDIAAQDIVDGHREKTLSLLWQLIYKFRSPKFHAAATVLQKWWRRHWLHWIQ 900 



901 RRI RHKELMRRHRAATVI QAVFRGHQMRKYVKLFKTERTQAAI I LQKFTRRYLAQKQLYQ 960 

| | | | | M | | | | | | || I I I I I II I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I 
901 RRI RHKELMRRH RAATVI QAVFRGHQMRKYVKLFKTERTQAAI ILQKFTRRYLAQKQLYQ 



961 S YH S 1 1 T I QRWWRAQQLGRQHRQRFVELREAAI FLQRI WRRRLFAKKLLAAAETARLQRS 

| MM HIM MINIMUM I ! II MUM I I I I I I I I I I I I I I I I I I II I 

961 SYHSIITIQRWWRAQQLGRQHRQRFVELREAAIFLQRIWRRRLFAKKLLAAAETARLQRS 

1021 QKQQAAASYIQMQWRTYQLGRIQRHEFLRQRDLIMFVQRRMRSKWSMLEQRKEFQQLKRA 

M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I M I I I I I I II I I I I I I I I I I I 
1021 QKQQAAASYIQMQWRTYQLGRIQRHEFLRQRDLIMFVQRRMRSKWSMLEQRKEFQQLKRA 

1081 AINIQQRWRAKLSMRKCNADYLALRS SVLKVQAYRKATIQMRIDRNHYYSLRKNVICLQQ 

M I I I I I I I I M II I I I I I I I I I I I I I I I II I I I I M I I I I I I I I I I I I 

1081 AINIQQRWRAKLSMRKCNADYLALRS SVLKVQAYRKATIQMRIDRNHYYSLRKNVICLQQ 



1141 
1141 
1201 
1201 
1261 
1261 
1321 
1321 
1381 
1381 
1441 



RLRAIMKMREQRENYLRLRNAS I LVQKRYRMRQQMI QDRNAYLRTRKCI INVQRRWRATL 

| | | | | M I I I I M I I M I I I I I I I I I I I I I I I I I I M II I II II I I M I I 

RLRAIMKMREQRENYLRLRNASI LVQKRYRMRQQMI QDRNAYLRTRKCI INVQRRWRATL 

QMRRERKNYLHLQTTTKRIQIKFRAKREMKKQRAEFLQLKKVTLWQKRRRALLQMRKER 

| | | | | | M I I I II II I I | I II II I I I I I I I I I I I I II I I I I I II I I I I I I I I I 

QMRRERKNYLHLQTTTKRI QI KFRAKREMKKQRAEFLQLKKVTLWQKRRRALLQMRKER 

QEYLHLREVTIKLQRRFHAQKSMRFMRAKYRGTQAAVSCLQMHWRNHLLRKRERNSFLQL 

| M || I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I! I I I I I I I I I I I I I 
QEYLHLREVT I KLQRRFHAQKSMRFMRAKYRGTQAAVS CLQMHWRNHLLRKRERNS FLQL 

RQAAITLQRRYRARLNMIKQLKS YAQLKQAAITI QTRYRAKKAMQKQWLYQKQREAI I K 1380 

MM I I t I I I 1 I I I MMIIIIIIII I MM 

RQAAITLQRRYRARLNMIKQLKS YAQLKQAAITIQTRYRAKKAMQKQWLYQKQREAI IK 1380 



960 
1020 
1020 
1080 
1080 
1140 
1140 
1200 
1200 
1260 
1260 
1320 
1320 



VQRRYRGNLEMRKQI EVYQKQRQAVI RLQKWWRS I RDMRLCKAGYRRI RLSSLS IQRKWR 

M I I I I I I II II I M II I I II II I II M II M II I II II M I M I M II 

VQRRYRGNLEMRKQI EVYQKQRQAVI RLQKWWRS IRDMRLCKAGYRRIRLSSLSIQRKWR 



ATVQARRQREIFLSTIRKVRLMQAFIRATLLMRQQRREFEMKRRAAWIQRRFRARCAML 

M | I I I II I I I I I I II I I II I I M I M I I I I I I I M I I I M I I I M I I I M I I M I II I I 
1441 AT VQ ARRQ REI FL S T I RKVRLMQ AF I RAT LLMRQQRRE FEMK RRAAWI Q RR FRARCAML 



1440 
1440 
1500 
1500 



1501 KARQDYQLIQSSVILVQRKFRANRSMKQARQEFVQLRTIAVHLQQKFRGKRLMI EQRNCF 1560 

Mill Mill III III II M I II I M I I M M M M M I I II I I I II I I M I II I 

1501 KARQDYQLIQSSVILVQRKFRANRSMKQARQEFVQLRTIAVHLQQKFRGKRLMI EQRNCF 1560 



Qv 1561 QLLRCSMPGFQARARGFMARKRFQAIjMTPEMMDLIRQKRAAKVIQRYWRGYLIRRRQKHQ 1620 

| | | | I | I I | I I I I I I I I I I I I I 1 Ill IIIIIIIIMIIMIIMM !! I INI 

Db 1561 QLLRCSMPGFQARARGFMARKRFQALMTPEMMDLIRQKRAAKVIQRYWRGYLIRRRQKHQ 1620 

Ov 1621 GLLDIRKRIAQLRQEAKAVNSVRCKVQEAVRFLRGRFIASDALAVLSQLDRLSRTVPHLL 1680 

| | | | | | | | | | | I I I I | | I | | | I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1621 GLLDIRKRIAQLRQEAKAVNSVRCKVQEAVRFLRGRFIASDALAVLSQLDRLSRTVPHLL 1680 

Ov 1681 MWCSEFMSTFCYGIMAQAIRSEVDKQLIERCSRIILNLARYNSTTVNTFQEGGLVTIAQM 1740 

I | 1 | | | | | I I I t I I I I I I I I I 1 I 1 I I 1 I t I I 1 1 I I I I 1 I I I I 1 I I 1 MMIIII 

Db 1681 MWCSEFMSTFCYGIMAQAI RSEVDKQLIERCSRIILNLARYNSTTVNTFQEGGLVTIAQM 1740 

Qv 1741 LLRWCDKDSEIFNTLCTLIWVFAHCPKKRKIIHDYMTNPEAIYMVRETKKLVARKEKMKQ 1800 

| | | | M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I Mill 

Db 1741 LLRWCDKDSEIFNTLCTLIWVFAHCPKKRKIIHDYMTNPEAIYMVRETKKLVARKEKMKQ 1800 

Ov 1801 NARKPPPMTSGRYKSQKINFTPCSLPSLEPDFGIIRYSPYTFISSVYAFDTILCKLQIDM 1860 

1 | 1 1 | | 1 | I I I I I I 1 1 I 1 | I I 1 1 I I I 1 I I I I I I I I 1 I 1 I I I I I I 1 "Ill 

Db 1801 NARKPPPMTSGRYKSQKINFTPCSLPSLEPDFGIIRYSPYTFISSVYAFDTILCKLQIDM 1860 

Qy 1861 F 1861 

I \ 
Db 1861 F 1861 

RESULT 2 
T19957 

hypothetical protein C45G3.1 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 09-Jul-2004 
C;Accession: T19957 
R;Barlow, K. 

submitted to the EMBL Data Library, March 1997 
A;Reference number: Z19203 
A;Accession: T19957 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 

A;^"-::™!: < Sp R OT:017666 i HB..OM7174.1, G S PD BiS N000 19 

CESP:C45G3.1 

A; Experimental source: clone C45G3 

C; Genetics : 

A; Gene: CESP:C45G3.1 

A;Map position: 1 

A;Introns: 21/1; 255/2; 363/2; 575/3; 893/3; 1017/2; 1042/1 
C;Superfamily: Caenorhabditis elegans hypothetical protein C45G3.1 

Query Match 3.8%; Score 361.5; DB 2; Length 1186; 

Best Local Similarity 18.6%; Pred. No. 6-5e-12; 

Matches 244; Conservative 216; Mismatches 480; Indels 375, Gaps 47, 

Ov 458 KKFKLAQTMSLMKK PATPRKVRDTSIQPSVKLYDSELYMQTCINPDPFAATTTIDPF 514 

Y • I I | | | | III : : I : : I : I : I I : : : : I 

Db 21 EKRLLDQVKSNTKKI DLRATERAFLES S PTSMNMRTPLNPS-ISSSVSDSPI 71 

Ov 515 LASTMYLDEQAVDRHQADFKKWLNALVSIPADLDADLNNKIDVGKLFNEV 564 

Y |: | |:| :: I I : :: |:: ::|:|: I 



Qy 

Db 

Qy 

Db 

Qy 

Db 
Qy 

Db 



Db 72 LS YDEKA-NKQIIALATWCNTM MELDVS EEMDLGE S KAEACRNI QKMLKK 120 

Qy 565 RNKELWAPTKEEQSMNY— LTKYRLETLRKAAVELFFSEQMRLPCSKVAVYVNKQALR 621 

|: | |:| I II ::l : I I : : ::l : 

Db 12 i RSDTSEVENTQENARRRYQRIFEKNDPEWKKKCKQLLDDSGMD— ASIKDLLSKNNVA 177 

622 IRSDRNLHLDWMQRTILELLLCFNPLWLRLGLEWFGEKIQMQSNRDIVGLSTFILNRL 681 

||: : : | : : | I : I I I : I I I : I I : I : I I : till:: 
178 IRKEHAVYNDIGLQTTLLHTFLSFHPAWLKTALEAI FNTRIDAQPKHLMKKLSQFFLDLV 237 

682 FRN— KCEEQRYSKAY— TLTEEYAETIKKHSLQKILFLLPFLDQAKQKRIVKHNPCLF 736 

|| :::::: : I I I ' : I I I : I '• "I I : : : : I 

238 FSNPTMLKNKKFAQGSGKPIITEAGKEALHKHFLSVSMKLMFLIETAHTHRVIPNLTRIF 297 

737 VKKS PHKETKDI LLRFS S ELL- ANI GDITRELRRLGYVLQHRQT FLDEFDYAFNNLAVDL 795 

|| |: : III : : :ll:l- HI : I 

298 TKSSHFNCLDDVFSELTKELLTGSSATFKKAFAKVGFIPTYRQSFIENYDYQAKGFS-DF 356 

7 96 RDGVRLTRWEVI— LLRDDLTRQLRVPAISRLQRIFNVKLALGALGEANFQLG— GDI 850 

||: | :::| : : : : :::| III: I : I I :: 

357 S DGLI LAKLLETVGEMPHGQI LLKLRDPAGDRI RKI GNVKI VLQEMS SLGVPTDNV 412 

Qy 851 AAQDIVDGHREKTLSLLWQLIYKFRSPKFHAAATVLQKWWRRHWLHWIQRRIR H 905 

Db 413 NAESIVGGKKDEILSILWAII GVRVAKEQRI KVTRVSE 450 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 656 

Ov 1164 LVQKRYRMRQQMIQDRNAYLRTRKCIINVQRRWRATLQMRRERKNYLHLQTTTKRIQIKF 1223 

|:| I I : : I: I :| : I :: 

Db 708 REQDEDDEN QDKDTVAPSAE NLDSPPSDIPLET 740 

Qy 1224 RAKREMKKQRAEFLQLKKVTLWQKRRRALLQMRKERQEYLHLREVTIKLQRRFHAQKSM 1283 

| | | | | : 'I 
Db 741 LSSIPSASQSAIFLQDSE TGKEM 763 

ov 1284 RFMRAKYRGT— QAAVSCLQMHWRNHLLRKRERNSFLQLRQAAITLQRRYRARLNMIKQ 1340 

Y : | : | : | : I : : I :|: I: II' 

Db 764 HVPKAEDVGWVLEASDSPVALEGNN EASYDGQKIENLETFEIKE 808 



906 KELMRRHRAA TVIQAVFRGHQMRKYVKLFKTERTQAAIILQKFTRRYLAQKQ 957 

: : : | : | : I I I : : : s : ' ' : 

451 ERTPKKRRSAVHDDMSSEVLKMCKIYGRQME— IEVMDLDSLSDGCLLAKLWTTFGTNST 508 

958 LYQSYHSIITIQRWWRAQQLGRQHRQRFVELREAAIFLQR— IWRRRLFAKKLLAAAETA 1015 

| | , | :: | : I : :M II I I 
509 PIQDYDG— LSLW EKWSVAELELCIQRGLDQNMALFVKMFL E 54 9 

1016 RLQRSQKQQAAASYIQMQWRTYQLGRIQRHE FL RQRDLIMF 1056 

|| | I : II I : I • M 1 : 11:1 

550 RLGMIQDLNEKATKIQRMWKAY VQRKNTPKLYFIVQQLLADSSIPRNRSVSPFSNN 605 

10 57 VQRRMRSKWSMLEQRKEFQQLKRAAINIQQRWRAKLSMRKCNADYLALRSSVLKVQ 1112 

| | | : : : I : I I : : I : I : I I = : 

606 VTFTVPRTPRN — NILTERPSLSQIPSS RQSMDSTFNDATFTVSRDSIESMN 655 

1113 AYRKA T IQMRI DRNHYYSLRKNVI CLQQRLRAIMKMREQRENYLRLRNAS I 1163 

: | MM: : I : : I : : : I I : 
KMQKTPLRGTFTRKTIAMVIEEEDDSENNETWPSTLKKRTWRMEHNAEVF 707 



Qy 1341 LKS YAQLKQAAITIQTRYRAKKAMQKQWLYQKQREAI I KVQRRYRGNLEMRKQI EVYQK 1400 

|: | : i: :: :: |:: I : : : I M : 

Db 809 GKTQEDLPSKSPMDPTQTSGSPLVEFRMTEEQERLEMLFQSLSEDQKNFVKTNNLSVSIE 868 

Qy 1401 QRQAVI RLQKWWRS I RDMRLCKAGYRRI RLS SLS IQRKWRATVQARRQREI FLST I RKVR 1460 

Db 869 DDANTPELRRILRQTRELK RKQQEI AR 895 

0v 1461 LMQAFIRATLLMRQQRREFEMKRRAAWIQRRFRARCAMLKARQDYQLIQSSVILVQRKF 1520 

| | :| : II : I II ::|: I h 

Db 896 KLGNIERNALAVRDGGEDSSDSRSDA GHDVAI LHGDDSQLFENSMQLDQK 945 

Qy 1521 RANRSMKQARQEFVQLRTIAVHLQQKFRGKRLMI EQRNCFQLLRCSMPGFQARARGFMAR 1580 

Db 946 945 

Qy 1581 KRFQALMTPEMMDLIRQKRAAKVIQRYWRGYLIRRRQKHQGLLDIRKRIAQ LRQEA 1636 

| | | :|:|| III: II:: II: : : : :ll Is I Ml 
Db 946 SQLQNDETQI LENKKKAAWI QKMI RGFI ARRKFQME- 1 SNI RNRMI QYNH I LAQED 1001 

Ov 1637 KAV NSVRCKVQEAVRFLRGRFIASDALAVL SQLDRLSRTVPHLL 1680 

Y : : Mi::: II : : I I I : : :||:: MM 

Db 1002 EQI GI EEMEDKS VEAKLKKCA — LHG — LTNDNLHWHVAATVI DRVTDLVP SLL 1052 

GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 

OM protein - protein search, using sw model 

Run on: February 22, 2005, 23:22:29 ; Search time 244 Seconds 

(without alignments) 
3905.653 Million cell updates/sec 

Title: US-09-914-698-1 
Perfect score: 9514 

Sequence: 1 MELVWSPVLEVACKETLQLI FI S SVYAFDTI LCKLQI DMF .1861 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1612378 seqs, 512079187 residues 

Total number of hits satisfying chosen parameters: 1612378 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : UniProt_03:* 

1 : uniprot_sprot : * 
2 : uniprot_trembl : * 

Pred. No. is the number of results predicted by chance to have a 



score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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RESULT 1 
AS P_DROME 

ID ASP_DROME STANDARD; PRT; 1954 AA. 

AC Q9VC45; 001401; Q8SX66; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-JAN-2005 (Rel. 46, Last annotation update) 

DE Abnormal spindle protein. 

GN Name=asp; ORFNames=CG687 5 ; 

OS Drosophila melanogaster (Fruit fly). 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; 
OC Ephydroidea; Drosophilidae; Drosophila. 
OX NCBI_TaxID=7227 ; 
RN [1] 

RP SEQUENCE FROM N.A. 
RC STRAIN=Berkeley; 

RX MEDLINE=20196006; PubMed=107 31132 ; DOI=10. 1126/science . 287 . 5461 . 2185 ; 
RA Adams M.D., Celniker S.E., Holt R.A. , Evans C.A., Gocayne J.D., 
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A. , Galle R.F., 
RA George R.A., Lewis S.E., Richards S., Ashburner M. , Henderson S.N., 
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q. , Chen L.X., 
RA Brandon R.C., Rogers Y.-H.C, Blazej R.G., Champe M., Pfeiffer B.D., 
RA Wan K.H., Doyle C, Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., 
RA Abril J.F., Agbayani A., An H.-J., Andrews-Pf annkoch C, Baldwin D., 
RA Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., 
RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., 
RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., 
RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., 
RA Cherry J.M., Cawley S., Dahlke C, Davenport L.B., Davies P., 
RA de Pablos B. , Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., 
RA Dodson K., Doup L.E., Downes M. , Dugan-Rocha S., Dunkov B.C., Dunn P. 
RA Durbin K.J., Evangelista C.C., Ferraz C, Ferriera S., Fleischmann W. 
Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K. , 
Glodek A., Gong F. , Gorrell J.H., Gu Z., Guan P., Harris M., 
Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., 
Hostin D., Houston K.A. , Howland T.J., Wei M.-H., Ibegwam C, 
Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J. A., Ketchum K.A. 
^ Kimmel B.E., Kodira CD., Kraft C, Kravitz S., Kulp D., Lai Z., 
RA Lasko P., Lei Y. , Levitsky A. A. , Li J.H., Li Z . , Liang Y., Lm X., 
RA Liu X., Mattel B., Mcintosh T.C., McLeod M.P., McPherson D., 
RA Merkulov G., Milshina N.V., Mobarry C, Morris J., Moshrefi A., 
RA Mount S.M., Moy M. , Murphy B., Murphy L. , Muzny D.M., Nelson D.L., 
RA Nelson D.R., Nelson K.A. , Nixon K., Nusskern D.R., Pacleb J.M., 

Palazzolo M. , Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., 
Reinert K., Remington K. , Saunders R.D.C., Scheeler F., Shen H., 
Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., 
Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., 
™ Svirskas R., Tector C, Turner R. , Venter E., Wang A.H., Wang X., 
RA Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., 
RA Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., 
RA Ye J., Yeh R.-F., Zaveri J.S., Zhan M. , Zhang G. , Zhao Q., Zheng L., 
RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S., Zhu X.,. Smith H.O 
RA Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.; ^ 
RT "The genome sequence of Drosophila melanogaster."; 
RL Science 287:2185-2195(2000). 



RA 
RA 
RA 
RA 
RA 
RA 



RA 
RA 
RA 
RA 
RA 



RN [2] 

RP GENOME REANNOTATION. 

RX MEDLINE=22426069; PubMed=12537572 ; 

Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., 
Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., 
Smith CD., Tupy J.L., Whitfield E.J., Bayraktaroglu L. , Berman B.P, 
Bettencourt B.R., Celniker S.E., de Grey A. D . N . J . , Drysdale R.A. , 
Harris N.L., Richter J., Russo S., Schroeder A. J., Shu S.Q., 
Stapleton M. , Yamada C, Ashburner M. , Gelbart W.M., Rubin G.M. , 



"Annotation of the Drosophila melanogaster euchromatic genome: 



a 



RA 
RA 
RA 
RA 
RA 
RA 

RA Lewis S . E. ; 
RT 

RT systematic review. . 

RL Genome Biol. 3 : RESEARCH0083 . 1-RESEARCH0083 . 22 (2002 ) . 

M? SEQUENCE OF 94-1954 FROM N.A., FUNCTION, SUBCELLULAR LOCATION, AND 

RP DEVELOPMENTAL STAGE. 

RC STRAIN=Oregon-R; 

RX PubMed=9151690; DOI=10 , 1083/jcb . 137 . 4 . 881; 

RA Saunders R.D.C., do Carmo Avides M. , Howard T.I.A., Gonzalez C, 

RA Glover D.M.; . 

RT "The Drosophila gene abnormal spindle encodes a novel microtubule 

RT associated protein that associates with the polar regions of the 

RT mitotic spindle."; 

RL J. Cell Biol. 137:881-890(1997). 

RN [4] 

RP SEQUENCE OF 700-1954 FROM N.A. 

RC STRAIN=Berkeley; TISSUE=Embryo; 

RX MEDLINE=22426066; PubMed=12537569 ; 

RA Stapleton M., Carlson J.W., Brokstein P., Yu C, Champ e M. , 

RA George R.A., Guarin H., Kronmiller B., Pacleb J.M., Park S. f Wan K.H. 

RA Rubin G.M., Celniker S.E.; 
RT 



"A Drosophila full-length cDNA resource."; 

RL Genome Biol. 3 : RESEARCH0080 . 1-RESEARCH0080 .8 (2002) . 

RN [5] 

RP FUNCTION, AND SUBCELLULAR LOCATION. 

RX PubMed=10073938; DOI=10 . 1126/science . 283 . 5408 . 1733 ; 

RA do Carmo Avides M. , Glover D.M. ; 

RT "Abnormal spindle protein, Asp, and the integrity of mitotic 

RT centrosomal microtubule organizing centers."; 

RL Science 283:1733-1735(1999). 

RN [6] 

RP FUNCTION. 

RX PubMed=15242765; DOI=10 . 1016/ j . yexcr . 2004 . 03 . 054 ; 

RA Riparbelli M.G., Massarelli C, Robbins L.G., Callami G. ; 

RT "The abnormal spindle protein is required for germ cell mitosis . and 

RT oocyte differentiation during Drosophila oogenesis."; 

RL Exp. Cell Res. 298:96-106(2004). 

CC FUNCTION: Required to maintain the structure of the centrosomal 

CC ' microtubule-organizing center (MTOC) during mitosis. May have a 

CC preferential role in regulating neurogenesis. Required for germ 

CC cell mitosis and oocyte differentiation. 

CC -'- SUBCELLULAR LOCATION: Cytoplasmic and nuclear. During interphase 

CC in syncytial embryos distribution is cytoplasmic. On entering 

CC mitosis, moves to polar regions of the spindle immediately 

CC surrounding the centrosome. At telophase, migrates to microtubules 

CC on the spindle side of both daughter nuclei. The nuclear- 

CC cytoplasmic distribution could be regulated by the availability of 
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- ! - 

- ! - 
_ i - 



calmodulin. 

DEVELOPMENTAL STAGE: Expressed both maternally and zygotically in 
embryos . 

SIMILARITY: Contains 1 calponin-homology (CH) domain. 
SIMILARITY: Contains 5 IQ domains. 

CAUTION: Ref.4 sequence differs from that shown due to intron 
retention. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on xts 
use by non-profit institutions as long as its content is m no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AE003749; AAF56330.3; -. 

EMBL; U95171; - AAB51540.1; 

EMBL,- AYO 94 8 25; AAM11178.1; ALT_SEQ. 

PIR; T13845; T13845. 

FlyBase; FBgn0000140; asp. 

GO; GO: 0005875; C microtubule associated complex; IDA. 

GO; GO: 0008017; F:microtubule binding; IDA. 

GO; GO: 0004672; F:protein kinase activity; IDA. 

InterPro; IPR001715; Calponin-like . 

InterPro; IPR000048; IQ_region. 

Pfam; PF00307; CH; 1. 

Pfam; PF00612; IQ; 21. 

SMART; SM00033; CH; 1. 

SMART; SM00015; IQ; 16. 

PROSITE; PS50021; CH; 1. 

PROSITE; PS50096; IQ; 5. . 
Calmodulin-binding; Cell division; Coiled coil; Microtubule; Mitosis, 

Nuclear protein; Oogenesis; Repeat. 



DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
CONFLICT 
. CONFLICT 
CONFLICT 
CONFLICT 
CONFLICT 
SEQUENCE 



836 
1004 
1386 
1467 
1614 
1656 
1690 
1063 

811 

898 
1129 
1138 
1761 
1954 AA; 



968 
1033 
1415 
1496 
1641 
1687 
1721 
1749 
811 
898 
1129 
1139 
1761 



CH 
IQ 1. 
IQ 2. 
IQ 3. 

Coiled coil (Potential) . 
IQ 4. 
IQ 5. 
Arg-rich. 

L -> P (in Ref . 3) . 
M -> V (in Ref. 3) . 
S -> T (in Ref. 3) . 
QQ -> HE (in Ref. 3) . 
R -> Q (in Ref. 1) . 



230178 MW; 4912B4E20CA9E659 CRC64; 
99.7%; Score 9482; DB 1; Length 1954; 



Query Match 
Best Local Similarity 99.7%; Pred. No. 0; 
Matches 1855; Conservative 4; Mismatches 



2; Indels 



0; Gaps 



0; 



3PT 60 



QY 
Db 



1 MELVWS PVLEVACKETLQLI DNRNFRKEVMI I LKSKSNQPVKNPRKFPTVGKTLQLKS I 

, I I I I I I I I I I I I I I I I I 1 I I M M I 1 I 1 I I ! I 1 I I I 1 I I I I I I I I I I 1 ! I I 1 M I I I 1 i 

94 MELVWS PVLEVACKETLQLI DNRNFRKEVMI I LKSKSNQPVKNPRKFPTVGKTLQLKS PT 153 



61 GAGKTMKS WSAAVQQKKRMSAAAAP P S KQTWRVTAP S RPAAWAH PPPQAPLVEKNVYKT 120 

1 1 | I 1 t I 1 I I I I 1 I I I I I I I 1 I I I I I 1 I I 1 I I I I 1 1 I I I t I 1 I I 1 1 1 I I I I I I I 

154 GAGKTMKSWSAAVQQKKRMSAAAAPPSKQTWRWAPSRPAAWAHPPPQAPLVEKNVYKT 213 

121 PQEEPVYISPQPRSLKENLSPMTPGNLLDVIDNLRFTPLTETRGKGQATIFPDNLAAWPT 180 

| | | | | | | | | I M | I I I I II I I I II I I I I I I I I M I I I I I M I I I I M I I I I I I M I I I I I 
214 PQEEPVYISPQPRSLKENLSPMTPGNLLDVIDNLRFTPLTETRGKGQATIFPDNLAAWPT 273 

181 PTLKGNVKSCANDMRPRRITPDDLEDQPATNKTFDVKHSETINISLDTLDCSRIDGQPHT 240 

M I I I I 1 I I I t 1 I I 1 1 1 I t Illllllllll I I II I I I I I I I I I I 

274 PTLKGNVKSCANDMRPRRITPDDLEDQPATNKTFDVKHSETINISLDTLDCSRIDGQPHT 333 



241 PLNKTTTIVHATHTRALACIHEEEGPSPPRTPTKSAIHDLKRDIKLVGSPLRKYSESMKD 300 

| | | M | | | I I I M I I 1 I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I M M I I I I 
334 PLNKTTTIVHATHTRALACIHEEEGPSPPRTPTKSAIHDLKRDIKLVGSPLRKYSESMKD 



393 



301 LSLLS PQTKYAIQGSMPNLNEMKI RS I EQNRYYQEQQIQI KAKDLNS S S S SEASLAGQQE 360 

| | | | 1 1 | M I I I I I I I I I I I I I I I I I M I I i I I I I I II I I I I I I I I I I M M I I I 

394 LS LLS PQTKYAI QGSMPNLNEMKI RS I EQNRYYQEQQI Q I KAKDLNS S S S S EASLAGQQE 



361 FLFNHSEI LAQS SRFNLHEVGRKSVKGS PVKNPHKRRSHELS FSDAP SNES LYRNETVAI 

I I I I | | M | I M I I I I I M M I I I I I I I I I I M I I I II I I I I I I M I I I I I I I I I I M II 

454 FLFNHSEILAQSSRFNLHEVGRKSVKGSPVKNPHKRRSHELSFSDAPSNESLYRNETVAI 

421 SPPKKQRVEDTTLPRSAAPANASARSSSAHAWPHAQSKKFKIAQTMSLMKKPATPRKVRD 

M I I I I I I I I I 1 I I I I I I I I I I I I I I I 1 I | | I I I I I I I I I I I I I I I I I I I I I I I 

514 SPPKKQRVEDTTLPRSAAPANASARSSSAHAWPHAQSKKFKLAQTMSLMKKPATPRKVRD 



453 
420 
513 
480 
573 



481 T S I Q P S VKL YD S EL YMQTC I N P D P FAATTT I D P FLAS TMYLDEQAVDRHQAD FKKWLNAL 540 



814 



7 81 LDEFDYAFNNLAVDLRDGVRLTRWEVILLRDDLTRQLRVPAISRLQRIFNVKLALGALG 

M | I I II I I I I I II II I M : I I I I II I I I I I I I I I I I M I I I I I I I I I I I I I I M 

874 LDEFDYAFNNLAVDLRDGVRLTRVMEVILLRDDLTRQLRVPAI SRLQRI FNVKLALGALG 

841 EANFQLGGDIAAQDIVDGHREKTLSLLWQLIYKFRSPKFHAAATVLQKWWRRHWLHWIQ 

MM || IIIMI lllllllll I 1 I I IIMIIIII III I I M I IIIIMIIIIIII 

934 EANFQLGGDIAAQDIVDGHREKTLSLLWQLIYKFRSPKFHAAATVLQKWWRRHWLHWIQ 



873 
840 
933 
900 
993 



901 RRI RHKELMRRHRAATVI QAVFRGHQMRKYVKLFKTERTQAAI I LQKFTRRYLAQKQLYQ 



I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I 11 I I I i M 1 I I I I I I I I M M I i 
994 RRIRHKELMRRHRAATVIQAVFRGHQMRKYVKLFKTERTQAAIILQKFTRRYLAQKQLYQ 1053 

961 SYHSIITIQRWWRAQQLGRQHRQRFVELREAAIFLQRIWRRRLFAKKLLAAAETARLQRS 1020 
| | | | | | | 1 | | I I I I I I I I I I I i I I I I I I I I I I I I I i I I I M I I I I I I I I I I I I M I M I I 
1054 SYHSIITIQRWWRAQQLGRQHRQRFVELREAAIFLQRIWRRRLFAKKLLAAAETARLQRS 1113 

1021 QKQQAAASYIQMQWRTYQLGRIQRHEFLRQRDLIMFVQRRMRSKWSMLEQRKEFQQLKRA 1080 

| | | | I I 1 I I M 1 I I 1 : I I I I ! I M :| I I I II I I I I I I I I I M I M M I I II II 

1114 QKQQAAASYIQMQWRSYQLGRIQRQQFLRQRDLIMFVQRRMRSKWSMLEQRKEFQQLKRA 1173 

1081 AINI QQRWRAKLSMRKCNADYLALRS SVLKVQAYRKATIQMRI DRNHYYSLRKNVI CLQQ 1140 

| | | | | | | | I I I I I I I I M I I I I I I I I I I I M I I I I I II I I 

1174 AIN I QQRWRAKLSMRKCNADYLALRS S VLKVQAYRKAT I QMRI DRNHYYSLRKNVI CLQQ 1233 

1141 RLRAIMKMREQRENYLRLRNASILVQKRYRMRQQMIQDRNAYLRTRKCIINVQRRWRATL 1200 

M | | | M | I M | 11 I 1 M 1 I 1 I 1 1 1 1 I 1 I I 1 1 1 I I 11 1 M I I I I I M 1 I I I I M I I I I I 1 
1234 RLRAIMKMREQRENYLRLRNASILVQKRYRMRQQMIQDRNAYLRTRKCIINVQRRWRATL 1293 

1201 QMRRERKNYLHLQTTTKRIQIKFRAKREMKKQRAEFLQLKKVTLWQKRRRALLQMRKER 1260 

| | I M 1 I I I I 1 I I 1 I I I 1 I I I I 1 I I I I 1 I M I I I I 1 I I I 1 Ml MINN " 

1294 QMRRERKNYLHLQTTTKRIQIKFRAKREMKKQRAEFLQLKKVTLWQKRRRALLQMRKER 1353 



1261 
1354 
1321 
1414 
1381 
1474 
1441 
1534 
1501 
1594 



1320 
1413 



QEYLHLREVTIKLQRRFHAQKSMRFMRAKYRGTQAAVSCLQMHWRNHLLRKRERNSFLQL 

I || | | | | M || I I I I I M I I I I I I I I II I I I M I I II I II I I I I I I I I I I I I I M 

QEYLHLREVTIKLQRRFHAQKSMRFMRAKYRGTQAAVSCLQMHWRNHLLRKRERNSFLQL 
RQAAITLQRRYRARLNMIKQLKSYAQLKQAAITIQTRYRAKKAMQKQWLYQKQREAIIK 1380 

IIIIIIIIMIIIIIIMIIIIIIMIIMMIIIIIIIIIIIIIIIIIIIIMMIIM 

RQ AAI T LQ RRY RARLNM I KQ L K S YAQ L KQAAI T I QT RY RAKKAMQ KQ WL YQ KQ REAI IK 1473 



VQ RRY RGN L EMRKQ I EVYQKQRQAVI RLQKWWRS I RDMRLCKAGYRRI RLS S LS I QRKWR 

I M | | | | | | | M I I I I I II I II I I I M M I I I I I II I I M I I I I I I I I I M I I I II I I I I 

VQRRYRGNLEMRKQI EVYQKQRQAVI RLQKWWRS I RDMRLCKAGYRRI RLS S LS I QRKWR 

ATVQARRQRE I FL S T I RKVRLMQAFI RAT LLMRQQRRE FEMKRRAAWI QRRFRARCAML 
M 1 1 I I 1 11 I 11 I I 1 1 I I I 1 I 1 I I I I 1 M M 1 I I I I I 1 I I I I 1 I I M I I I M I I I I 1 I I I 
ATVQARRQRE I FL S T I RKVRLMQAFI RATLLMRQQRRE FEMKRRAAWI QRRFRARCAML 

KARQDYQLIQS SVI LVQRKFRANRSMKQARQEFVQLRTIAVHLQQKFRGKRLMI EQRNCF 

I I I I I | I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I M I I I I I I 

KARQDYQLIQS SVI LVQRKFRANRSMKQARQEFVQLRT I AVHLQQKFRGKRLMI EQRNCF 



1561 QLLRCSMPGFQARARGFMARKRFQALMTPEMMDLIRQKRAAKVIQRYWRGYLIRRRQKHQ 

| | M I I I I I II I I I I M I I I I M I I I I I I I I I I I I II I M I I I I I I I I I M I I I I 

1654 QLLRC SMPGFQARARGFMARKRFQALMT P EMMDLI RQKRAAKVI QRYWRGYL I RRRQKHQ 

1621 GLLDIRKRIAQLRQEAKAVNSVRCKVQEAVRFLRGRFIASDALAVLSQLDRLSRTVPHLL 

| | M I 1 I I 1 I 1 1 I M 1 I 1 I I M I I M I I II I I 1 I I I I I I 1 M I ' ,hl I I I I I I 1 I I I 
1714 GLLDI RKRI AQLRQEAKAVN S VRCKVQEAVRFLRGRFI AS DALAVLS RLDRLS RTVPHLL 



1681 MWCSEFMSTFCYGIMAQAIRSEVDKQLIERCSRIILNLARYNSTTVNTFQEGGLVTIAQM 

I M I I I I ! I I I I t I I I I I 1 M 1 I I M I I I I I I I 1 I I I 1 I I I M I I I LI 1 ILLi_ l _LIiIJ 

1774 



1440 
1533 
1500 
1593 
1560 
1653 
1620 
1713 
1680 
1773 
1740 



MWCSEFMSTFCYGIMAQAIRSEVDKQLIERCSRIILNLARYNSTTVNTFQEGGLVTIAQM 1833 



1741 LLRWCDKDSEIFNTLCTLIWVFAHCPKKRKIIHDYMTNPEAIYMVRETKKLVARKEKMKQ 1800 
| | | | | | || || || | || || I || I I I I II I I I M I I I M I I M I I I I I I I I M II I I I I I I I I 



Db 



1834 LLRWCDKDSEIFNTLCTLIWFAHCPKKRKIIHDYMTNPEAIYMVRETKKLVARKEKMKQ 1893 



Qy 1801 NARKPPPMTSGRYKSQKINFTPCSLPSLEPDFGIIRYSPYTFISSVYAFDTILCKLQIDM 1860 

| | | | | | | | I I I 1 I I I I I I I I I I I I I I I I I I I 1 I I I I I I M I I M I I I I I I I I I I I M 1 I 1 
Db 1894 NARKPPPMTSGRYKSQKINFTPCSLPSLEPDFGIIRYSPYTFISSVYAFDTILCKLQIDM 1953 

Qy 1861 F 1861 

i 

Db 1954 F 1954 



RESULT 2 
Q7QAG9 

ID Q7QAG9 PRELIMINARY; PRT; 1538 AA. 

AC Q7QAG9; 

DT * 01-MAR-2004 (TrEMBLrel. 26, Created) 

DT 01-MAR-2004 (TrEMBLrel. 26, Last sequence update) 

DT- 01-MAR-2004 (TrEMBLrel. 26, Last annotation update) 

DE AgCP7874 (Fragment) . 

GN Name=agCG45985; ORFNames=ENSANGG00000018773; 

OS Anopheles gambiae str. PEST. 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles. 

OX NCBI_TaxID=180454; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-PEST; 

RA Anopheles Genome Sequencing Consortium; 

RL Submitted (MAR-2002) to the EMBL/ GenBank/DDB J databases. 

CC -'- CAUTION: The sequence shown here is derived from an 

CC ' EMBL/ GenBank/DDB J whole genome shotgun (WGS) entry which is 

CC preliminary data. 

DR EMBL; AAAB01008888 ; EAA08939.1; 

DR InterPro; IPR001715; Calponin-like . 

DR InterPro; IPR000048; IQ_region. 

DR Pfam; PF00307; CH; 1. 

DR Pfam; PF00612; IQ; 31. 

DR PROSITE; PS50021; CH; 1. 

DR PROSITE; PS50096; IQ; 6. 

FT NON_TER 1 1 

FT NON TER 1538 1538 

SQ SEQUENCE ■ 1538 AA; 183217 MW; 0604DF326B743CC1 CRC64; 

Query Match 32.5%; Score 3095; DB 2; Length 1538; 

Best Local Similarity 41.8%; Pred. No. 5e-155; 

Matches 652; Conservative 298; Mismatches 425; Indels 184; Gaps 15 

Ov 4 63 AQTMS LMKKPAT P RKVRDT S I Q P S VKL YD S EL YMQT CI N P DP FAATTT I D P FLAS TMYLD 522 

^ Y . . . . | i | : | | : | : I Mil: : : : I I I I I I I M I M : I I I I I I I I 

3 SRSLSL-KRTAVPCSLPPKSEEKRVFLYDSDRHLKTLINPDPFAATTTCNPFLTVTMYLD 61 

Ov 523 EQAVDRHQADFKKWLNALVSIPADLDADLNNKIDVGKLFNEVRNKELWAPTKEEQSMNY 582 

| : | j : : : I I I I I I I h I I I I I I : I : I I I I I I : I I : *• I I I : I I II I I I 
Db 62 ERAFEQYERQMKKWLNALVTIPADLDTEPNKPLDVGKLFDEVKSKELTLAPTKELISSKY 121 

ov 583 LTKYRLETLRKAAVELFFSEQMRLPCSKVAVYVNKQALRIRSDRNLHLDWMQRTILELL 642 

yY in | | | : | : | | : : : I III : I I I : I : I I I I I I I : I : I I I I I I I 



Db 



Db 122 Y-KTRLNHLRSAGIALYTSEEIAMPLRKVAAQIEKQLLSLRTDRNLHLDLVLQRSILELL 180 

Qy 643 LCFNPLWLRLGLEWFGEKIQMQSNRDIVGLSTFILNRLFRNKCEEQRYSKAYTLTEEYA 702 

Db 181 LCFNPLWLRLGLEWFGEQIELQSNRDIVGLSTFIIHRLFRDRYLEARNSKAYNLSRAYA 240 

Qy 703 ETIKKHSLQKILFLLPFLDQAKQKRIVKHNPCLFVKKSPHKETKDILLRFSSELLANIGD 762 

| ::| :|: :|||l III I I : : : : : I II I I I I I : : I I I I I I : I I : I I : h I : : I I I 
Db 241 EHMRKFTLRMVLFLLLFLDTAKRRKLIKHNPCLFVRNAPHKETKEILIRFASQLVSGIGD 300 

Qy 763 ITRELRRLGYVLQHRQTFLDEFDYAFNNLAVDLRDGVRLTRWEVILLRDDLTRQLRVPA 822 

||: ::|:|||| | : | : I I I I : : I I I I I I I I I I I I I I I I I I : I : I I I I I I I : INI 
Db 301 ITKHMKRVGYVLSHKQSFLDEYNYAFENLAVDLRDGVRLTRVMEIILLRDDLSASLRVPP 360 

Qv 823 ISRLQRIFNVKLALGALGEANFQLGGDIAAQDIVDGHREKTLSLLWQLIYKFRSPKFHAA 882 

|||||:M:IIIM :h": l = : 1 = 11 I I I I I : I : I I I I I :: I I I I : I I I : I I 
Db 361 ISRLQKIHNINLALVALEQADYKIAGNVTAKDICDGHREQTMSLLWQIVYKFRAPKFNAA 420 

Qy 883 ATVLQKWWRRHWLHWIQRRIRHKELMRRHRAATVIQAVFRGHQMRKYVKLFKTERTQAA 942 

| | | | : | | | : I I I I I M I Ml M III ||: :| : : : :: :| 
Db 421 AIVLQRWWR>INWLKVTISRRIEEKRALRREAAARTIQAAVRGYCVRVWYEAHRRQKLRAI 480 

0v 943 hlqkfTRRYLAQKQLYQSYHSIITIQRWWRAQQLGRQHRQRFVELREAAIFLQRIWRRR 1002 

: : | : | : I I I II I I : : : I : I I : I I I : I I I : I I : I : I I I I : I I 
Db 481 VTIQRFSRRYLAQKLAARRFSAIVRIQQWWRTVRQMRQARERFLLCRKSAIVLQTSYRRY 540 

1003 LFAKKLLAAAE TARLQRSQKQ 1023 

: I I I I I I I : : I : I 

541 ALGRKLLAAATLIGQIRAEAKHRHLQATIIQRSIKSYVIHRRLHATVNGMVAFIRRKRLQ 600 

Ov 1024 QAAASYIQMQWRTYQLGRIQRHEFLRQRDLIMFVQRRMRSKWSMLEQRKEFQQLKRAAIN 1083 

:|: II I :U I = =IM I : I I = = I I 

Db 601 NRSAAKIQ AYQRMRIVRKEYLRSRSAAICIQRRWRECMEARQLRNRFLLMRASAIR 656 

Ov 1084 IQQRWRAKLSMRKCNADYLALRSSVLKVQAYRKATIQMRIDRNHYYSLRKNVICLQQRLR 1143 

Y : | | : : | | | : | I : : : : I I : I : I I : I : I : I I : I '1:1 I 

Db 657 LQQQYRGWRQMRQDRHTYANARNLIVQVQRRWRGTLAMRKERANYRTLRRVTINVQRRFR 716 

Ov 1144 AIMKMREQREN YLRLRNAS I LVQKRYRMRQQMI QDRNAYLRTRKC I INVQRRWRATLQMR 1203 

yY | |: : | | | I:: :|:|:| : I:: I I I : llll:ll I U 

Db 717 ARQAMQSEVERYRTLCKATVTLQQRFRANKAMMEQRQQYNSLRVATLCVQRRFRAQLSMR 776 

Ov 1204 RERKNYLHLQTTTKRIQIKFRAKREMKKQRAEFLQLKKVTLWQKRRRALLQMRKERQEY 1263 

| s| :: II ::M I : I I : I : : I : I I I I I : I I = I I 

Db 777 AARAS YAKVRCAI LT I Q S Q YRAT LAMRHARDRFVT LRRCT I T VQARFRAI LAG RAAKQRY 836 

Ov 1264 LHLREVTIKLQRRFHAQKSMRFMRAKYRGTQAAVSCLQMHWRNHLLRKRERNSFL 1318 

:|: |: :||:: I II :h I I::: I: :l 

Db 837 ESIRKATLHIQRKWRATLEMRQVRSHYRRQCNAALTLQRSWRGVLLQRKFRHDYLLYRGA 896 

1319 QLRQAAITLQRRYRARLNMI KQLKSYAQLKQAAITIQT 1356 



Qy 

Db 



Qy 



Qy 

Db 



I 11:1:111 II I I : I : : M : I 



Db 897 ATVLQRRYRALVQGRMVRREMQHCRWAAVTIQRRLRATLQMNRDRKAFLQLRQSVLWQR 956 



1357 RYRAKKAMQKQWLY QKQR EAI I KVQRRYRGNL 1389 

| • | | : | : I I I : : I I I :: I I I I I 

957 RFRANRACRVQRVQYAALKRSAITI SHRWAATLHMRQQRS DFLRLKSATWMQRRYRAQR 1016 



Ov 1390 EMRKQI EVYQKQRQAVI RLQKWWRS IRDMRLCKAGYRRI RLS S LS IQRKWRATVQARRQR 1449 

:: :: |:: I |:: II: :|: I I I: : :: :h :l =1 ' ' 

Db 1017 AKQQAVQQYERMRAAIVLLQRKYRAQRAMEKCRGRFLNLKSASIWQEFYRGYRNMRHDR 1076 

Ov 1450 EIFLSTIRKVRLMQAFIRATLLMRQQRREFEMKRRAAWIQRRFRARCAMLKARQDYQLI 1509 

|: | : | | II II hi I 1:1 I: :ll II Ihl I: : : 

Db 1077 AAFIRLRESVLAIQRRFRGKLLTRQTWEYERKRKAAITLQRWFRGHRAMVKERKRFTAL 1136 

QV 1510 QSSVILVQRKFRANR SMKQARQEFVQ LRT 1538 

: : : : | : : I I I ||:: I : I =11 

Db 1137 KHAATVIQARYRARRAMLHDAAQYGRIYRAVTTIQIHWKATLSMRRERDRYQQTLRAVRT 1196 

Ov 1539 IAVH LQQKFRGKRLMI EQRNC FQLLRCSMPGFQ 1571 

UY | : | :|:::| I II 

Db 119 7 IQIHYRAYRERLIDEANYRIYRSAVIWQRRYREKL QTRCESLRFEQIRRTVRGLQ 1252 

Qy 1572 ARARGFMARKRFQALMT PEMMDLI RQKRAAKVIQRYWRGYLI RRRQKHQGLLDI RKRIAQ 1631 

|| :||: I 11:111 :: 11 = : I I II =111 1 = 1 = : I " 
Db 1253 TYGRGVLARRAFLALLTPEYLERKRQQKAALRIQAWWRGAYHRKRYQTMQMRKIAQQMVA 1312 

Ov 1632 LRQEAKAVNSVRCK— VQEAVRFLRGRFIASDALAVLSQLDRLSRTVPHLLMWCSEFMST 1689 

Y | |s ::| : : I I I : I I : I : I : : I : I : I : M I I I I I I : hi 

Db 1313 SRMAARRDPTIRLSNVSRLCLRFLKTRFSSSEAIGILKRLERMSRLVPHLLMEDAVFLSV 1372 

Ov 1690 FCYGIMAQAIRSEVDKQLIERCSRIILNLARYNSTTVNTFQEGGLVTIAQMLLRWCDKDS 174 9 

Ml : | I I I I I I I I I I IN I : I I I I I I I I : I I I I I I I I :: I I I I I M I I I 
Db 1373 FCYNMMAQAI RSEVDKI LI EICARI I LNLARFRGTKEQAFQEDGLVTVSQMLLRWCDKDC 1432 

Qv 1750 EIFNTLCTLIWVFAHCPKKRKIIHDYMTNPEAIYMVRETKKLVARKEKMKQNARKP 1805 

| | : | | | | | : | | | I II: I I I :: I I I I : I I I I I I I I I I I I " I - I 
Db 1433 GIFSTLCTLLWVLAHDNKKKNAIRRYMISKDAIYMLRETKKLVQRKEKMRKNVQRPVGCL 1492 

0v i 806 PPMTSGRYKSQKINFTPCSLPSLEPDFGIIRYSPYTFISSVYAFDTILCKLQIDM 1860 

Y | : : : I I I I I I I I : I I I I I I I : I : I \--\- 

Db 1493 VAPNPQLMR TVP S LEP D FGVNRS KP YVFYS S VFGFERVLQMLEVDL 1538 



